A Shallow Algorithm for Correcting Nesting Errors and Other Well-Formedness Violations in XML-like Input

نویسنده

  • Christian Siefkes
چکیده

We argue that there are some special situations where it can be useful to repair well-formedness violations occurring in XML-like input, giving examples from our own work. We analyze the types of errors that can occur in XML-like input and present a shallow algorithm that fixes most of these errors, without requiring knowledge of a DTD or XML Schema. A Shallow Algorithm for Correcting Nesting Errors and Other Well-Formedness Violations in XML-like Input Table of

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

An Incremental Correction Algorithm for XML Documents and Single Type Tree Grammars

XML documents represent an integral part of the contemporary Web. Unfortunately, a relatively high number of them is affected by well-formedness errors, structural invalidity or data inconsistencies. The purpose of this paper is to continue with our previous work on a correction model for invalid XML documents with respect to schemata in DTD and XML Schema languages. Contrary to other existing ...

متن کامل

Design and implementation of Persian spelling detection and correction system based on Semantic

Persian Language has a special feature (grapheme, homophone, and multi-shape clinging characters) in electronic devices. Furthermore, design and implementation of NLP tools for Persian are more challenging than other languages (e.g. English or German). Spelling tools are used widely for editing user texts like emails and text in editors.  Also developing Persian tools will provide Persian progr...

متن کامل

An approach to fault detection and correction in design of systems using of Turbo ‎codes‎

We present an approach to design of fault tolerant computing systems. In this paper, a technique is employed that enable the combination of several codes, in order to obtain flexibility in the design of error correcting codes. Code combining techniques are very effective, which one of these codes are turbo codes. The Algorithm-based fault tolerance techniques that to detect errors rely on the c...

متن کامل

An Approach to Increasing Reliability Using Syndrome Extension

Computational errors in numerical data processing may be detected efficiently by using parity values associated with real number codes, even when inherent round off errors are allowed in addition to failure disruptions. This paper examines correcting turbo codes by straightforward application of an algorithm derived for finite-field codes, modified to operate over any field. There are syndromes...

متن کامل

Integration of Syntactic, Semantic and Contextual Information in Processing Grammatically Ill-Formed Inputs

This paper describes an integrated method for processing grammatically i l l formed inputs We use partial parses of the input for recov ering from parsing failure In order to select partial parses appropriate for error recovery, cost and reward are assigned to them Cost and reward represent the badness and goodness of a partial parse, respectively The most appropriate partial parse is selected ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2004